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Sir: 

This Supplemental Appeal Brief is submitted in response to the Office Action dated May 9, 
2003 in the above-referenced application, in which the Examiner reopened prosecution in response 
to the Appeal Brief filed February 12, 2003. 

Applicant (hereinafter referred to as "Appellant") has submitted concurrently herewith a 
response to the Office Action, requesting reinstatement of the appeal pursuant to 37 C.F.R. 
§1. 193(b)(2). 

REAL PARTY IN INTEREST 
The present application is assigned to International Business Machines Corporation, as o» 
evidenced by an assignment recorded May 7, 1999 in the U.S. Patent and Trademark Office at Reel 
9946, Frame 0742. The assignee, International Business Machines Corporation, is the real party in 
interest. 

RELATED APPEALS AND INTERFERENCES 
There are no known related appeals and interferences. 
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STATUS OF CLAIMS 
Claims 1-4, 12-15, 23, 24 and 26 stand rejected under 35 U.S.C. §102(e). Claims 5-11, 16- 
22, 25 and 27 stand rejected under 35 U.S.C. §103(a). Claims 1-27 are appealed. 

STATUS OF AMENDMENTS 
There has been no amendment filed subsequent to the final rejection. However, a Response 
to Final Office Action was filed on December 12, 2002, along with the Notice of Appeal. 

SUMMARY OF INVENTION 

The present invention provides techniques that may aid in decision support applications by 
automatically selecting data relevant to an analysis. This may be accomplished by using the structure 
of the source dataset in combination with the content of the source element collection to identify the 
closest element collections within one or more target datasets (Specification, page 7, lines 23-27). 

For example, the present invention, as recited in independent claim 1, defines a method of 
automating navigation between data with dissimilar structures including a source dataset containing 
one or more data elements and at least one target dataset containing one or more data elements. The 
method comprises the steps of: (i) determining at least one collection of data elements fi-om the at 
least one target dataset that best matches a collection of data elements from the source dataset based 
on structures associated with the source dataset and the target dataset; and (ii) computing at least 
one distance metric between the at least one target collection and the source collection such that a 
user can select the at least one target collection given the at least one computed distance metric. 

The invention may be implemented in a form which includes functional components as 
shown in FIG. 1 A (Specification, page 12, line 12, through page 13, line 19). A first component is 
referred to as an inter-dataset navigation engine (IDNE) 140. The IDNE is invoked by analysis 
applications 1 10-1 through 1 10-N to automate the selection of related data. The IDNE makes use 
of another component referred to as dataset access services 130. The dataset access services 
component knows the accessible datasets and their structures, creates and manipulates collection 
descriptors, and provides access to elements within a dataset that are specified by a collection 
descriptor. 
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By way of example, in one embodiment, automated navigation according to the invention 
may be accomplished in the following manner. First, the IDNE 1 40 iterates across all target datasets 
to do the following: (a) use the structure of the source and target datasets to transform the source 
collection descriptor into a preliminary collection descriptor for the subset of the target dataset that 
is closest to the source element collection; (b) construct the final collection descriptor by 
transforming the preliminary collection descriptor until it specifies a non-null subset of the target 
dataset; and (c) compute a distance metric representing how close the source element collection (or 
collection descriptor) is to the target element collection (or collection descriptor). The IDNE 140 
then returns a list of triples including a name of the target dataset, a target collection descriptor, and 
a value of the distance metric for each target dataset (Specification, page 8, lines 9-19). 

A flow diagram illustrating an automated navigation method according to an exemplary 
embodiment of the invention is shown in FIG. 2 (Specification, page 1 4, line 8, through page 1 5, line 
13). Further, a flow diagram illustrating a technique for computing a target collection descriptor that 
best matches a source collection descriptor according to an exemplary embodiment of the invention 
is shown in FIG. 3 (Specification, page 15, line 14, through page 17, line 14). Still fiirther, a flow 
diagram illustrating a technique for computing a distance metric according to an exemplary 
embodiment of the invention is shown in FIG. 4 (Specification, page 17, line 15, through page 18, 
line 10). Lastly, a diagram illustrating a graphical user interface for presenting exemplary results 
associated with automatic navigation according to an exemplary embodiment of the invention is 
shown in FIG. 5 (Specification, page 18, lines 11-19). 

Accordingly, the present invention may provide automation for selecting datasets relevant 
to analysis tasks. Such automation is crucial to improving the productivity of decision support in 
systems management applications. The automation enabled by the invention provides value in many 
ways. For example, the invention makes the novice analyst more expert by providing a list of target 
datasets and collection descriptors that are closest to an element collection at hand (i.e., the source 
element collection). As a result, the novice focuses on the datasets that are most likely to be of 
interest in the analysis task. By way of further example, the invention makes expert analysis more 
productive. This is achieved by providing the target collection descriptor for each target dataset 
thereby enabling the construction of a system in which analysts need only click on a target dataset 
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(or collection descriptor) in order to navigate to its associated element collection (Specification, page 
9, line 24, through page 10, line 6). 

Automated navigation according to the invention may be applied in many domains. For 
example, in analysis of manufacturing lines, measurement datasets exist for machines in the 
manufacturing line as well as for the interconnection of these machines. Automated navigation 
according to the invention can aid with decision support for scheduling and planning based on this 
data. By way of a further example, in transportation systems, datasets exist for measurements taken 
by road sensors and traffic reports. Automated navigation according to the invention can aid in 
planning highway capacity over the entire network of roadways. It is to be understood that the above 
applications are merely exemplary in nature and not intended to limit the applicability of the 
invention. Furthermore, it is to be appreciated that automated navigation according to the invention 
can be accomplished centrally at a server or in a distributed manner amongst several smaller server 
machines (Specification, page 10, line 22, through page 1 1, line 4). 

ISSUES PRESENTED FOR REVIEW 

(I) Whether claims 1-4, 12-15, 23, 24 and 26 areproperly rejected under 35 U.S.C. §102(e) 
as being anticipated by U.S. Patent No. 6,828,3 1 8 to Dietrich et al. (hereinafter "Dietrich"); and 

(II) Whether claims 5-11,1 6-22, 25 and 27 are properly rejected under 35 U.S.C. § 103(a) as 
being unpatentable over Dietrich in view of U.S. Patent No. 5,970,490 to Morgenstem et al. 
(hereinafter "Morgenstem"). 

GROUPING OF CLAIMS 
Claims 1-27 stand or fall together. 

ARGUMENT 

Appellant incorporates by reference herein the disclosure of all previous responses filed in 
the present application, namely, responses dated: May 2, 2001 ; November 30, 2001 ; June 28, 2002; 
December 1 2, 2002; and February 1 2, 2003 . Sections (I) and (II) to follow will respectively address 
issues (I) and (II) presented above. 
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(I) With regard to the issue of whether claims 1 -4, 1 2- 1 5, 23, 24 and 26 are properly rejected 
under 35 U.S.C. § 102(e) as being anticipated by Dietrich, the Office Action contends that Dietrich 
discloses all of the claim limitations recited in the subject claims. Appellant respectfully asserts that 
Dietrich fails to disclose all the claim limitations of the subject claims. 

As stated above, the present invention, for example as recited in independent claim 1 , defines 
a method of automating navigation betv^een data v^^ith dissim ilar structures including a source dataset 
containing one or more data elements and at least one target dataset containing one or more data 
elements. The method comprises the steps of: (i) determining at least one collection of data elements 
from the at least one target dataset that best matches a collection of data elements from the source 
dataset based on structures associated with the source d ataset and the target dataset: and (ii) 
computing at least one distance metric between the at least one target collection and the source 
collection such that a user can select the at least one target collection given the at least one computed 
distance metric. The independent claim had previously been amended in Appellant's response dated 
June 28, 2002 to further clarify that the determination step is based on stru ctures associated with the 
source dataset and the target dataset . Independent claim 12 defines a similar apparatus-based 
invention, while independent claim 23 defines a similar article of manufacture-based invention. 
Independent claims 24 and 26 recite other embodiments of such automated navigation techniques. 
Each independent claim recites that the best match determination is based on structures associated 
with the source dataset and at least one target dataset . 

On the other hand, Dietrich discloses a method for matching the elements of two data files. 
More specifically, the method reads data elements and corresponding attributes in the data files, 
before performing pattern matching on the elements and attributes. A best matching of the elements 
is then performed. Dietrich suggests that the method is especially beneficial for reconciling two or 
more databases that may contain data related to the same set of items. Examples of such data 
reconciliation include merging mailing lists and merging lists of people. 

Appellant asserts that Dietrich fails to teach all of the claim limitations of independent claims 
1 , 1 2, 23, 24 and 26. The inventive steps (or operations) comprise determining at least one collection 
of data elements from at least one target dataset which best matches a collection of data elements 
from a source dataset based on structures associated with the source dataset and at least one target 
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dataset; and then computing at least one distance metric between the target collection and the source 
collection such that the user can select the target collection. As explained above, the claimed 
invention is directed to techniques for automating navigation between data with dissimilar structures 
which comprises determining at least one collection of data elements from the at least one target 
dataset that best matches a collection of data elements from the source dataset based on structures 
associated with the source dataset and the target dataset , 

Dietrich, as is evident, considers items based on the substantive content in the items , not 
based on structures associated with the items , as in the claimed invention. That is, for example, 
Dietrich determines the measure of similarity of each element in one set with each element in another 
set (col 3, lines 46-50). The similarity measure corresponds to some aspect or attribute of the data 
associated with the elements. The latest Office Action refers to col. 1, lines 16-26, in anticipating 
claim 1 of the present invention. This section refers to a manufacturing process example in which 
two databases to be reconciled refer to a specific part with different part numbers. The attributes of 
the part to be considered in the similarity measure may include size, color or speed (col. 3, lines 54- 
56). Thus, Dietrich does not account for structural dissimilarity (e.g., format), as does the claimed 
invention, but rather, accounts for substantive similarity (e.g., the specific attributes of the part, size 
color, or speed). Therefore, Dietrich fails to disclose determining a best match based on structures 
associated with the source dataset and the target dataset , as in the claimed invention. This is a 
fundamental difference between the two approaches. 

Furthermore, Dietrich also fails to perform a distance metric computation after a 
determination of a best match, as recited by the claimed invention. The latest Office Action contends 
that Dietrich's similarity measure anticipates this step. While Dietrich lists several similarity 
measures that may be used in its similarity operation (col. 2, lines 54-57), these measures assess the 
relevant attributes in order to determine the best match, as described above. Dietrich does not 
disclose a separate distance measure computation between target and source collections after a 
determination of best match. This is another fixndamental difference between Dietrich and 
independent claims 1, 12, 23, 24 and 26. 

For a clear example of what type of data problem that the invention may provide a solution 
for with respect to dissimilar data structures, see the example provided in the context of QoS (quality 
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of service) management at page 3, line 1, to page 4, line 27, of the present specification. This is 
significantly different than the substantive similarity problem that Dietrich attempts to address. 
Thus, Dietrich fails to disclose all of the limitations of independent claims 1,12, 23, 24 and 

26. 

For at least the reasons given above, Appellant respectfully requests withdrawal of the 
^ §102(e) rejections of independent claims 1, 12, 23, 24 and 26. Further, not only due to their 
respective dependence on such independent claims but also because such claims recite patentable 
subject matter in their own right, Appellant respectfully requests withdrawal of the § 1 02(e) rejections 
of dependent claims 2-4 and 13-15. 

By way of example, Dietrich fails to teach or suggest the following claimed features: (i) 
wherein there is a plurality of target datasets from which respective best matching target collections 
are determined and respective distance metrics are computed such that the computed distance metrics 
are presented to the user in a ranked order (claims 2 and 13); (ii) wherein the presenting step further 
includes presenting the respective target collection to the user along with the respective computed 
distance metric (claims 3 and 14); and (iii) wherein the presenting step further includes presenting 
a respective name associated with the target dataset to the user along with the respective target 
collection and the computed distance metric (claims 4 and 15). 

(II) With regard to the issue ofwhether claims 5-11, 16-22,25 and 27 are unpatentable under 
35 U.S.C. § 103(a) over Dietrich in view of Morgenstem, the Office Action contends that the cited 
combination discloses all of the claim limitations recited in the subject claims. Appellant asserts that 
the combination of Dietrich and Morganstem fails to teach or suggest all of the claim limitations 
recited in the subject claims and, therefore, fails to establish a prima facie case of obviousness under 
35 U.S.C. §103(a), as specified in M.P.E.P. §2143. Thus, Appellant respectfully traverses the 
§103(a) rejection of claims 5-11, 16-22, 25 and 27 for at least the following reasons. 

Appellant hereby alleges and incorporates by reference the arguments relating to claims 1, 
12, 23, 24 and 26 above in their entirety. Accordingly, due at least to the fact that claims 5-11,16- 
22, 25 and 27 respectively depend from independent claims 1 , 1 2, 24 and 26, it is believed that such 
dependent claims are allowable for at least the reasons identified above and, therefore, Appellant 
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requests withdrawal of the § 103(a) rejections. That is, Morgenstem fails to remedy the deficiencies 
described above with respect to Dietrich. Further, Appellant asserts that claims 5-11, 16-22, 25 and 
27 recite patentable subject matter in their own right. 

By way of example, the combination of Dietrich and Morgenstem fails to teach or suggest 
the following claimed features: (i) wherein the source collection of data elements is specified by a 
source collection descriptor and the at least one target collection of data elements is specified by a 
target collection descriptor (claims 5 and 16); (ii) wherein the data is organized in a relational 
database and further wherein the determining step includes deleting at least one attribute associated 
with the target collection descriptor that is not present in the source collection descriptor (claims 6 
and 17); (iii) wherein the data is organized in a multidimensional database and further wherein the 
determining step includes performing at least one drill-up operation on the target collection 
descriptor (claims 7 and 18); (iv) wherein the determining step further includes the steps of: 
generating at least one preliminary target collection descriptor associated with the at least one target 
collection by transforming a source collection descriptor associated with the source collection; and 
removing constraints associated with the at least one preliminary target collection descriptor until 
a non-null element collection is obtained (claims 8, 19, 25 and 27); (v) wherein the source collection 
of data elements is specified by a source collection descriptor and the at least one target collection 
of data elements is specified by a target collection descriptor and further wherein the computing step 
includes calculating the difference between constraints in the source collection descriptor and the 
target collection descriptor to compute the distance metric (claims 9 and 20); (vi) wherein attributes 
of the constraints are weighted in accordance with their importance (claims 10 and 21); and (vii) 
wherein the distance metric is proportionally larger when the source and target collection descriptors 
differ by an attribute of a constraint that has a heavier weight associated therewith (claims 1 1 and 
22). 
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For at least the foregoing reasons, claims 1-27 are believed to be patentable over the cited 
references. As such, the application is asserted to be in condition for allowance, and favorable action 
is respectfully solicited. 



Date: September 9, 2003 



Respectfully submitted, 

William E. Lewis 
Attorney for Applicant(s) 
Reg. No. 39,274 
Ryan, Mason & Lewis, LLP 
90 Forest Avenue 
Locust Valley, NY 11560 
(516) 759-2946 




APPENDIX 

1 . A method of automating navigation between data with dissimilar structures including a 
source dataset containing one or more data elements and at least one target dataset containing one 
or more data elements, the method comprising the steps of: 

determining at least one collection of data elements from the at least one target dataset that 
best matches a collection of data elements from the source dataset based on structures associated 
with the source dataset and the target dataset; and 

computing at least one distance metric between the at least one target collection and the 
source collection such that a user can select the at least one target collection given the at least one 
computed distance metric. 

2. The method of Claim 1, wherein there is a plurality of target datasets from which 
respective best matching target collections are determined and respective distance metrics are 
computed such that the computed distance metrics are presented to the user in a ranked order. 

3. The method of Claim 2, wherein the presenting step ftirther includes presenting the 
respective target collection to the user along with the respective computed distance metric. 

4. The method of Claim 3, wherein the presenting step ftirther includes presenting a 
respective name associated with the target dataset to the user along with the respective target 
collection and the computed distance metric. 

5. The method of Claim 1, wherein the source collection of data elements is specified by a 
source collection descriptor and the at least one target collection of data elements is specified by a 
target collection descriptor. 

6. The method of Claim 5, wherein the data is organized in a relational database and ftirther 
wherein the determining step includes deleting at least one attribute associated with the target 
collection descriptor that is not present in the source collection descriptor. 



10 



7. The method of Claim 5, wherein the data is organized in a multidimensional database and 
further wherein the determining step includes performing at least one drill-up operation on the target 
collection descriptor. 

8. The method of Claim 1, wherein the determining step further includes the steps of: 
generating at least one preliminary target collection descriptor associated with the at least one 

target collection by transforming a source collection descriptor associated with the source collection; 
and 

removing constraints associated with the at least one preliminary target collection descriptor 
until a non-null element collection is obtained. 

9. The method of Claim 1, wherein the source collection of data elements is specified by a 
source collection descriptor and the at least one target collection of data elements is specified by a 
target collection descriptor and further wherein the computing step includes calculating the 
difference between constraints in the source collection descriptor and the target collection descriptor 
to compute the distance metric. 

1 0. The method of Claim 9, wherein attributes of the constraints are weighted in accordance 
with their importance. 

11. The method of Claim 10, wherein the distance metric is proportionally larger when the 
source and target collection descriptors differ by an attribute of a constraint that has a heavier weight 
associated therewith. 

12. Apparatus for automating navigation between data with dissimilar structures including 
a source dataset containing one or more data elements and at least one target dataset containing one 
or more data elements, the apparatus comprising: 

at least one processor operable to determine at least one collection of data elements from the 
at least one target dataset that best matches a collection of data elements from the source dataset 
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based on structures associated with the source dataset and the target dataset, and to compute at least 
one distance metric between the at least one target collection and the source collection such that a 
user can select the at least one target collection given the at least one computed distance metric; and 
a memory coupled to the at least one processor for storing the at least one target dataset. 

13. The apparatus of Claim 12, wherein there is a plurality of target datasets from which 
respective best matching target collections are determined and respective distance metrics are 
computed such that the computed distance metrics are presented to the user in a ranked order. 

14. The apparatus of Claim 13, wherein the at least one processor is further operable to 
present the respective target collection to the user along with the respective computed distance 
metric. 

15. The apparatus of Claim 14, wherein the at least one processor is further operable to 
present a respective name associated with the target dataset to the user along with the respective 
target collection and the computed distance metric. 

16. The apparatus of Claim 12, wherein the source collection of data elements is specified 
by a source collection descriptor and the at least one target collection of data elements is specified 
by a target collection descriptor. 

17. The apparatus of Claim 16, wherein the data is organized in a relational database and 
further wherein the at least one processor is operable to perform the determining operation by 
deleting at least one attribute associated with the target collection descriptor that is not present in the 
source collection descriptor. 

18. The apparatus of Claim 1 6, wherein the data is organized in a multidimensional database 
and further wherein the at least one processor is operable to perform the determining operation by 
performing at least one drill-up operation on the target collection descriptor. 
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19. The apparatus of Claim 12, wherein the at least one processor is further operable to 
perform the determining operation by generating at least one preliminary target collection descriptor 
associated with the at least one target collection by transforming a source collection descriptor 
associated with the source collection, and removing constraints associated with the at least one 
preliminary target collection descriptor until a non-null element collection is obtained. 

20. The apparatus of Claim 12, wherein the source collection of data elements is specified 
by a source collection descriptor and the at least one target collection of data elements is specified 
by a target collection descriptor and further wherein the at least one processor is operable to perform 
the computing operation by calculating the difference between constraints in the source collection 
descriptor and the target collection descriptor to compute the distance metric. 

21. The apparatus of Claim 20, wherein attributes of the constraints are weighted in 
accordance with their importance. 

22. The apparatus of Claim 21, wherein the distance metric is proportionally larger when 
the source and target collection descriptors differ by an attribute of a constraint that has a heavier 
weight associated therewith. 

23. An article of manufacture for automating navigation between data with dissimilar 
structures including a source dataset containing one or more data elements and at least one target 
dataset containing one or more data elements, comprising a machine readable medium containing 
one or more programs which when executed implement the steps of: 

determining at least one collection of data elements from the at least one target dataset that 
best matches a collection of data elements from the source dataset based on structures associated 
with the source dataset and the target dataset; and 



13 



computing at least one distance metric between the at least one target collection and the 
source collection such that a user can select the at least one target collection given the at least one 
computed distance metric. 

24. A computer-based method of automatically navigating between data with dissimilar 
structures including a source dataset containing one or more data elements and a plurality of target 
datasets respectively containing one or more data elements, the method comprising the steps of: 

determining one or more collections of data elements from the plurality of target datasets that 
best match a collection of data elements from the source dataset, the determination being based on 
the structures associated with the source dataset and the plurality of target datasets; and 

computing one or more distance metrics between the one or more target collections and the 
source collection. 

25. The method of Claim 24, wherein the determining step fiirther comprises the steps of: 
generating one or more preliminary target collection descriptors associated with the one or 

more target collections by transforming a source collection descriptor associated with the source 
collection; and 

removing constraints associated with the one or more preliminary target collection 
descriptors until a non-null element collection is obtained. 

26. Apparatus for automatically navigating between data with dissimilar structures including 
a source dataset containing one or more data elements and a plurality of target datasets respectively 
containing one or more data elements, the apparatus comprising: 

means for determining one or more collections of data elements from the plurality of target 
datasets that best match a collection of data elements from the source dataset, the determination 
being based on the structures associated with the source dataset and the plurality of target datasets; 
and 

means for computing one or more distance metrics between the one or more target collections 
and the source collection. 
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27. The apparatus of Claim 26, wherein the determining means further comprises: 
means for generating one or more preHminary target collection descriptors associated with 

the one or more target collections by transforming a source collection descriptor associated with the 

source collection; and 

means for removing constraints associated with the one or more preliminary target collection 
descriptors until a non-null element collection is obtained. 
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